Goto

Collaborating Authors

 similar article


Harnessing PubMed User Query Logs for Post Hoc Explanations of Recommended Similar Articles

arXiv.org Artificial Intelligence

Searching for a related article based on a reference article is an integral part of scientific research. PubMed, like many academic search engines, has a "similar articles" feature that recommends articles relevant to the current article viewed by a user. Explaining recommended items can be of great utility to users, particularly in the literature search process. With more than a million biomedical papers being published each year, explaining the recommended similar articles would facilitate researchers and clinicians in searching for related articles. Nonetheless, the majority of current literature recommendation systems lack explanations for their suggestions. We employ a post hoc approach to explaining recommendations by identifying relevant tokens in the titles of similar articles. Our major contribution is building PubCLogs by repurposing 5.6 million pairs of coclicked articles from PubMed's user query logs. Using our PubCLogs dataset, we train the Highlight Similar Article Title (HSAT), a transformer-based model designed to select the most relevant parts of the title of a similar article, based on the title and abstract of a seed article. HSAT demonstrates strong performance in our empirical evaluations, achieving an F1 score of 91.72 percent on the PubCLogs test set, considerably outperforming several baselines including BM25 (70.62), MPNet (67.11), MedCPT (62.22), GPT-3.5 (46.00), and GPT-4 (64.89). Additional evaluations on a separate, manually annotated test set further verifies HSAT's performance. Moreover, participants of our user study indicate a preference for HSAT, due to its superior balance between conciseness and comprehensiveness. Our study suggests that repurposing user query logs of academic search engines can be a promising way to train state-of-the-art models for explaining literature recommendation.


CoLe and LYS at BioASQ MESINESP8 Task: similarity based descriptor assignment in Spanish

arXiv.org Artificial Intelligence

In this paper, we describe our participation in the mesinesp Task of the BioASQ biomedical semantic indexing challenge. The participating system follows an approach based solely on conventional information retrieval tools. We have evaluated various alternatives for extracting index terms from IBECS/LILACS documents in order to be stored in an Apache Lucene index. Those indexed representations are queried using the contents of the article to be annotated and a ranked list of candidate labels is created from the retrieved documents. We also have evaluated a sort of limited Label Powerset approach which creates meta-labels joining pairs of DeCS labels with high co-occurrence scores, and an alternative method based on label profile matching. Results obtained in official runs seem to confirm the suitability of this approach for languages like Spanish.


Scientific Documents Similarity Search With Deep Learning Using Transformers (SciBERT)

#artificialintelligence

Let have a look at some random articles. Here we limit the printing to the first hundred words, because some of them are very long. This step aims to vectorize the articles' abstract text so that we can perform the similarity analysis. Since we are dealing with the scientific documents, we will use SciBERT, which is a pre-trained language model for Scientific text data. You can find more information about it on Semantic Scholar.


Supercharge Knowledge Management With Help From AI

#artificialintelligence

In his 1999 book Management Challenges for the 21 Century, Austrian-born American management consultant, professor, and author Peter Drucker wrote of the importance of "the coordination and exploitation of organizations' knowledge resources, in order to create benefit and competitive advantage." Today, businesses have embraced his point, demonstrating how maintaining and growing an organization's information to assist its employees and customers offer those benefits and advantage. As a practice, this collecting and sharing of information is referred to as knowledge management. Even prior to Drucker's observation, the Consortium for Service Innovation had already begun its work in 1992 on Knowledge-Centered Service (previously known as Knowledge-Centered Support) or KCS *. KCS is a method that focuses on organizational knowledge as a key asset that can benefit, among other things, customer service delivery.


Product age based demand forecast model for fashion retail

arXiv.org Machine Learning

Fashion retailers require accurate demand forecasts for the next season, almost a year in advance, for demand management and supply chain planning purposes. Accurate forecasts are important to ensure retailers' profitability and to reduce environmental damage caused by disposal of unsold inventory. It is challenging because most products are new in a season and have short life cycles, huge sales variations and long lead-times. In this paper, we present a novel product age based forecast model, where product age refers to the number of weeks since its launch, and show that it outperforms existing models. We demonstrate the robust performance of the approach through real world use case of a multinational fashion retailer having over 300 stores, 35k items and around 40 categories. The main contributions of this work include unique and significant feature engineering for product attribute values, accurate demand forecast 6-12 months in advance and extending our approach to recommend product launch time for the next season. We use our fashion assortment optimization model to produce list and quantity of items to be listed in a store for the next season that maximizes total revenue and satisfies business constraints. We found a revenue uplift of 41% from our framework in comparison to the retailer's plan. We also compare our forecast results with the current methods and show that it outperforms existing models. Our framework leads to better ordering, inventory planning, assortment planning and overall increase in profit for the retailer's supply chain.


Named Entity Recognition: Applications and Use Cases

#artificialintelligence

Named Entity Recognition is a process where an algorithm takes a string of text (sentence or paragraph) as input and identifies relevant nouns (people, places, and organizations) that are mentioned in that string. In our previous blog, we gave you a glimpse of how our Named Entity Recognition API works under the hood. In this post, we list some scenarios and use cases of Named Entity Recognition technology. News and publishing houses generate large amounts of online content on a daily basis and managing them correctly is very important to get the most use of each article. Named Entity Recognition can automatically scan entire articles and reveal which are the major people, organizations, and places discussed in them.